CDS
Accession Number | TCMCG078C24192 |
gbkey | CDS |
Protein Id | KAG0492827.1 |
Location | join(39942383..39942949,39944619..39944804,39945348..39945563,39945692..39945802,39952102..39952299,39952375..39952506,39954104..39954160,39954273..39954389,39955144..39955221,39960439..39960582,39961273..39962316) |
Organism | Vanilla planifolia |
locus_tag | HPP92_006225 |
Protein
Length | 949aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA633886, BioSample:SAMN14973820 |
db_source | JADCNL010000002.1 |
Definition | hypothetical protein HPP92_006225 [Vanilla planifolia] |
Locus_tag | HPP92_006225 |
EGGNOG-MAPPER Annotation
COG_category | U |
Description | AP-4 complex subunit epsilon |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko04131 [VIEW IN KEGG] |
KEGG_ko |
ko:K12400
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko04142
[VIEW IN KEGG] map04142 [VIEW IN KEGG] |
GOs |
GO:0005575
[VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005829 [VIEW IN EMBL-EBI] GO:0005911 [VIEW IN EMBL-EBI] GO:0009506 [VIEW IN EMBL-EBI] GO:0016020 [VIEW IN EMBL-EBI] GO:0030054 [VIEW IN EMBL-EBI] GO:0030117 [VIEW IN EMBL-EBI] GO:0030119 [VIEW IN EMBL-EBI] GO:0030124 [VIEW IN EMBL-EBI] GO:0032991 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044425 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0048475 [VIEW IN EMBL-EBI] GO:0055044 [VIEW IN EMBL-EBI] GO:0098796 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGGCTCCCAAGGCGGTTGGGGCCAGTCCAAGGAGTTCCTGGATCTGGTGAAGTCCATCGGCGAGGCCCGCTCCAAGGCGGAGGAGGACCGCATCGTTCTTCGCGAGATCGAGACTCTGAAGCGACGGATCGCGGAGCCAGACGTCACGCGACGCAAGATGAAGGAGTACATCGTACGTCTCGTCTATGTTGAGATGCTTGGCCATGATGCTTCCTTTGGGTATATTCATGCCGTGAAGATGACTCACGACGATAATGTTGTCCACAAACGTACTGGTTATCTTGCTGTGACGCTCTTCTTGAACGAGAATCACGATCTTATCATCCTCATTGTGAATACCATACAGAAGGATTTGAAGTCCGATAACTATTTGGTAGTTTCCGCTGCTCTGACGGCGGTGTGTAAGCTCATCAACGAGGAGACGATCCCAGCCGTGTTGCCACAGGTGGTGGAGCTCCTTGGGCATCCCAAGGAGGCTGTAAGGAAGAAGGCAGTCATGGCACTGCACCGGTTCTACCAGCGTTCACCAGCTTCAGTATCCCACCTCCTCTTACATTTCAGGAAGAGGCTTTGTGATGGTGATCCTGGAGTAATGGGTGCTGCACTATGTCCTATTTTTGATCTTATCACGGCTGATGTAAACTCATACAAGGATCTGGTTGTCAGTTTTGTGAGCATTCTTAAGCAAGTTGTTGAAAGAAGATTGCCCAAGTCATATGAATACCATCAAATGCCTGCTCCATTTCTTCAGGTTAAGTTACTTAAGATTCTTGCGTTGCTGGGTAGTGCGGATAAGCAAGCGAGTGGACACATGTACGCTGTACTGGGTGAGATATTTAGGAAGTGTGAAATGTCAAGCAACATTGGTAATGCTGTGCTCTATGAATGCATCTGCTGTGTCTCATCTATCCAGCCGAATACGAAGTTGCTAGATGCTGCTACTGAAGCAACTTCAAAATTTCTGAAGAGTGACAGTCATAATCTCAAATACATGGGAATTGATGCCCTTGGTCGACTGATTAAGATAAACCCTGATATTGCTGAGGATCATCAGCTGGCTGTTATTGATTGCTTGGAAGATTCTGATGATACTTTAAAGAGGAAGACCTTTGAGTTACTTTATAAAATGACAAAATCCACCAACGTTGAAGTCATAGTTGATCGGATGATTAATTATATGATTTCCATAAGCGATAAGCATTATAAAACTGAAATAGCATCACGTTGTGTTGAGCTTGCCGAACAATTTGCTCCAAGCAATCAATGGTTTATCCAGACTATGAATAAGATCTTTGAGCATGCTGGTGACGTAGTAAATGTCAAAGTGGCACACAATTTAATAAGGCTTATTGCTGAAGGATTTGGAGAAGATGACGACGGTGCAGATAGCCAATTAAGATCCTCAGCTGTTGACTCATATTTGCACATTCTTTCAGAACCAAAGCTTCCTTCCATTTTCTTGCAAGTCATATGCTGGGTGCTGGGAGAGTATGGTACCGCAGATGGGAAGTATTCTGCATCTTTTATTATTGGTAAAATTTGTGATGTTGCAGAGGCACATACAAATGACAGCACTGTTAAGGCTTATGCAATAACGAGTATCATGAAAGTTTGTGCATTTGAAATTGCTGCTGGAAGGAAGGTGGAAATGTTGCCCGAGTGTCAATCTTTAATCGACGAACTATTAGCTTCCCATTCAACTGATCTGCAGCAGCGTGCGTATGAGCTACAAGCTCTGTCGTGCTTGGATAGTCATGTTATTCAACATGTGATGCCCCCAGATGCTAGCTGCGAAGATGTTGAGGTTGATAAAACCTTGTCTTTCCTCGACGATTTTGTGCAAAAAGCGTTTGAGAAAGGCGCACAGCCCTACGTTCCTGAGAGCGAAAGGTCTGGTGTGTACGACATCAGCAGCTTTAGAAACCAATATCAACAAGAACAATCTGGGCATGGTCTCAGGTTTGAAGCTTATGAACTCCCCAAGTCCTTACCACCAACAAATATCCCCACAATTCTCCATCCCCTTCCATCCACCGATGTCGTCCCTGTTTTTGAACCATCACATTCACGACCAACCCATCAAACATCATCCGGTGTCGATGTTTCCTCGGACGTCGGAGTTAAGCTCAGGCTTGATGGTGTTCAGAGGAAGTGGGGCAGGCCAACCGAATTATCTTCTTCTTCAACCTCTGGCTCTGCAACTGAAAGTGCAGCAAATGGGTTCTCGCAATCTGATGGATGGAAGAATGCAGCTTCGCATCCCCGAGATTTCTCATCCGACAAGAGGAATCCACCACCGGTTGAAGTATCGACCGAGAAGCAGAGACTCGCTGCATCCCTGTTTGGTTCTTCAGCATCCAAATCAGAAAAGAAACCACCATCCACACGCAGCTCATCCAGGGCAAGTAATGCAAATTCTGTGAAGCCAACTTCTGCAACTCTTCCTACAGAACCTCCAAAGGAGAAAGGCTCTGCTTCCACACCTCCGCCTCCCGATTTACTTGACTTGGGCGAGTCATTTCCCTCGAGTCCTCCATCCGAAGACCCATTCAAGCAGCTTGAAGGGCTCATCGGACCAGACACTGCTCCCACCATCAATCCTCCTCTTACAGCAACCATTCCAAACACCGCAAACATCATCTCATCATACAGCGAAGCCACTTCGCCTGATCTCGGCACCGATTTCATTTCTCAATTCACCAAAACCTCTCACGGAGCTCATGGCATCAGCTCAGCAAAGAAGGGACCAAATTCTAGAGAAGCACTGGAGAAGGATGCCGTCGTAAGACACGTAGGTGTGACACCCACAGGTAATAATCCCAACCTGTTTAGAGATCTTCTGAGCTGA |
Protein: MGSQGGWGQSKEFLDLVKSIGEARSKAEEDRIVLREIETLKRRIAEPDVTRRKMKEYIVRLVYVEMLGHDASFGYIHAVKMTHDDNVVHKRTGYLAVTLFLNENHDLIILIVNTIQKDLKSDNYLVVSAALTAVCKLINEETIPAVLPQVVELLGHPKEAVRKKAVMALHRFYQRSPASVSHLLLHFRKRLCDGDPGVMGAALCPIFDLITADVNSYKDLVVSFVSILKQVVERRLPKSYEYHQMPAPFLQVKLLKILALLGSADKQASGHMYAVLGEIFRKCEMSSNIGNAVLYECICCVSSIQPNTKLLDAATEATSKFLKSDSHNLKYMGIDALGRLIKINPDIAEDHQLAVIDCLEDSDDTLKRKTFELLYKMTKSTNVEVIVDRMINYMISISDKHYKTEIASRCVELAEQFAPSNQWFIQTMNKIFEHAGDVVNVKVAHNLIRLIAEGFGEDDDGADSQLRSSAVDSYLHILSEPKLPSIFLQVICWVLGEYGTADGKYSASFIIGKICDVAEAHTNDSTVKAYAITSIMKVCAFEIAAGRKVEMLPECQSLIDELLASHSTDLQQRAYELQALSCLDSHVIQHVMPPDASCEDVEVDKTLSFLDDFVQKAFEKGAQPYVPESERSGVYDISSFRNQYQQEQSGHGLRFEAYELPKSLPPTNIPTILHPLPSTDVVPVFEPSHSRPTHQTSSGVDVSSDVGVKLRLDGVQRKWGRPTELSSSSTSGSATESAANGFSQSDGWKNAASHPRDFSSDKRNPPPVEVSTEKQRLAASLFGSSASKSEKKPPSTRSSSRASNANSVKPTSATLPTEPPKEKGSASTPPPPDLLDLGESFPSSPPSEDPFKQLEGLIGPDTAPTINPPLTATIPNTANIISSYSEATSPDLGTDFISQFTKTSHGAHGISSAKKGPNSREALEKDAVVRHVGVTPTGNNPNLFRDLLS |